CDS
Accession Number | TCMCG075C03227 |
gbkey | CDS |
Protein Id | XP_017973629.1 |
Location | join(31283918..31285701,31289764..31289840,31290263..31290691,31290848..31291023,31291335..31291453,31291535..31291695,31292169..31292302,31292587..31292803,31292893..31293022,31293470..31293767) |
Gene | LOC18613571 |
GeneID | 18613571 |
Organism | Theobroma cacao |
Protein
Length | 1174aa |
Molecule type | protein |
Topology | linear |
Data_file_division | PLN |
dblink | BioProject:PRJNA341501 |
db_source | XM_018118140.1 |
Definition | PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein LOC18613571 [Theobroma cacao] |
EGGNOG-MAPPER Annotation
Sequence
CDS: ATGATTGAAATTAAATCAAAACTTACGGTTGCTTTTTTAAGAGGCTACTACATAAAGCTAGCAAAATCCTAGCAAGCAATGCGAGCTTCTGCAAGGCACATCTCCCCCACTCATCCAACTCTTCATTTTGCACACGCGACCCTTGAATCCTTCATCTGGAACACCCTCATACGAGGCAATGCTCAGGCTACAGCTAGACCCACGTTCCCCACAATCTCCATCTACATTCGTATGCGCTTCCATGGCGTTTCACCTGATTTCCATACTTTCCCATTCCTCCTGCAATCATTCAATTCCTCATTTCACCTTCTCTCAGGTAAACAAATCCATGCCCAAACTATCCTCTTCGGCCTCGTCCAAGACCCATTTGTCCAAACGTCACTCATCAACATGTACTCATCTTGTGGCGACTTAATTGTTTCCCGTCAAGCTTTCGACGAGATCACCCAACCTGACGTAGCGTCTTGGAATTCAATTATTCATGCCTATGTTAAAGTGGGGTTGATCGATTTAGCACGGGGTTTATTCGATAAAATGCCGGAGAGAAATGTGAGATCTTGGAGTAGCTTGATAAATGGGTTTGTGAGGTGTGGGAAATATAAGGAAGCGCTTGCTTTGTTCCGTGAAATGCAAATGTTGGCAGTTAACGATGTTAGGCCAAATGAATTTACAATGTCTGCTGTACTTTCTGCTTGTGGGCGTTTGGGTGCTCTGGAGCATGGGAAATGGGCTCATGCTTACATTGACAAATGTGGGATAAAAATTGATGTTGTCTTGGGAACTTCTTTGATAGACATGTACGGAAAATGTGGGAGTATTGAAAAGGCGAGGGATGTGTTTAGTAATTTAGGTCCTGATAAAGATGTGCTGGCTTGGAGTGCTATGATTTCTGGCCTGGCTATGCATGGCCATGGTGATGAATGCCTTAAATTGTTTTCGGAGATGATAAAACGACAGGTGAGGCCGAATGCTGTAACATTTTTGGGTGTACTTTGTGCTTGTGTACATGGAGGTTTGGTGAACGACGGAAAGGAGTATTTTCGGAGGATGAGCAAGGAGTTTGGTATCATTCCTTCGATACAGCACTTCGGTGCCATGGTTGACCTTTATGGAAGAGCTGGTCTCATTGATGAAGCATGGAATGTGGTTAAATCTATGCCTATGGAGCCTGATGTGCTCGTATGGGGGTCCCTGTTGAGTGGATCCAGGATGTGTGGGAACATTGAAACATGCGAGGTTGCACTCAGGAAGTTAATTGAGTTAGATCCCACAAATAGTGGTGCATATGTGCTCCTTTCAAATGTGTACGCAAAGACGGGCAGATGGACAGAAGTAAGGCGTGTGAGAGACGTCATGGAGGGTAGGGGGATTAAGAAGGTTCCTGGATGTAGTTTGGTTGAGATGGGAGGTGTCCTCCACGAGTTTTTTGTGGGAGATGATTCTCTTCCAGAGAGCAGAGAGATATACATGATGCTTGATGAGATTATGAAGAGATTGAAGTTGGAGGGTTATGTGGGAAACACCAAGGAGGTTTTGCTTGATTTGGATGAGGAAGGAAAGGAGTTGGCACTATCCCTTCATAGTGAAAAGATAGCTATTGCCTTTTGCTTCTTAAAGACAAGTCCAGGTATTCCTATTCGCATCATCAAGAACCTTAGGATATGTCTTGATTGTCATGTTGCAATAAAAATGATATCTAAGGTGTTTGCCCGAGAGATTGTTATTAGAGACTGCAATAGGTTTCACCATTTTAGAGAGGGAGCATGTTCTTGTGAAGATTACTGTTGGAAGCATCACCAGCCGTCGCCGGGGCTCCGCCTTTGTCGGTGGAGAGCTCGGGCGATCCTACGTGTCTTTACTGGATGTTTGAACAGTGAAGGGAGGCTGATTAAAATGGTCTTAGTACGTAAGCAAGCTGCTGATGACAGCATGTTGGGGATTTCCAGTATTTTGGAAGATGAAAAATGTGAAAGTTTTTTGCTTTTTTATGGTGAAACTCTGCTTGGGAATGGGTTACGAGCCTTCCTGTATTTTCTTGGCCTTGCTTATTGCTTCATTGGGTTGTCAGCCATAACTGCTCGTTTCTTCCGATCAATGGAGAATGTTGTGAAACACACACGTACAGTTGTAGAGATAGATCCTGTCACTAATACTGAAATTTATAGACAAGAAAAGGTGTGGAATTATACTATTGCAGACATTACTTTGCTTGCCTTTGGGACTAGCTTCCCTCAAATATCTTTAGCTACTATTGATGCCATTCGAAACATTGGGGATCTGTATGCTGGAGGTTTAGGTCCAGGAACTCTTGTTGGTTCGGCTGCATTCGACCTTTTCCCCATCCATGCTGTCTGTGTTGTAGTTCCAAAAGCTGGAGAGTTGAAAAAAATATCTGATATAGGTGTTTGGCTAGTGGAGCTCTTTTGGTCCTTTTGGGCTTATGCTTGGCTGTACATAATTTTAGAGGTATGGACACCAAAAGTGGTTACGCTTTGGGAGGCCTTATTGACAGTACTGATGTACGGACTACTGCTGACTCATGCATATGCTCAAGACAAACGTTGGCCTTATATATCTCTGCCCATAGAGAGAACTGAGAGGCCTGAGGACTGGGTGCCAGCAGAGGTTGCCTCAGTTAAACATGAGGGTGATGCCTGTGATGGATACTCTGAGATACTTCCAGTTGAAGAAAATGAAGGGAAGGACACTGTTGATATCTTTTCCTTTCATTCAGAAATTGGGGCAGGCTCTTCTTATCAAAAGGTGTCCACAATCGAAGATTTAGTCGAAACCTCGACTAGACCCTTTCAGAAGGAGATCGATTTAGAAGATCCTCATGTGCTTGAACTTTGGAAAACACAATTCCTGGATGCACTCACATTGGAAAGCCCTGAATCAAGAAAACTGAACAACATCCATCTTCGGCTAGCAAGTATTGTTTGGCAGTCACTACTTGCACCCTGGAGAGTTCTGTTTGCCATTGTGCCTCCTTATCAAATTGCTCATGGATGGATTGCCTTCATTTGCTCTCTACTTTTCATCAGTGGGATAGCTTACATTGTAACAGAGCTGACGGATCTTATAAGCTGTGTCACAGGGATAAATGCTTATGTCATAGCGTTTACAGCATTAGCTGCTGGCACTTCATGGCCAGATTTAGTAGCAAGTAAGATTGCTGCTGAACGTCAGATAACAGCTGACTCTGCTATAGCAAACATTACTTGCAGCAATTCAGTGAATATTTATGTGGGCATTGGCATTCCGTGGCTGATTGATACTGCATACAATTTCATAGCATATAGAGAACCATTGCGGATACAAAATGCAGGGGGACTCAGCTTCTCTCTGCTTGTATTCTTCTCCACCTCTGTAGGCTGTATTTTGGTATTGGTGATTAGGCGCCTGACACTGGGGGCGGAGCTCGGAGGCCCTAGGATTTGGGCCTGGGTTACCAGTGTATTCTTCATGTTGCTTTGGATTATATTTGTGGTCCTCTCTTCCCTTCGAGTTTCTGGCATCATATGA |
Protein: MIEIKSKLTVAFLRGYYIKLAKSXQAMRASARHISPTHPTLHFAHATLESFIWNTLIRGNAQATARPTFPTISIYIRMRFHGVSPDFHTFPFLLQSFNSSFHLLSGKQIHAQTILFGLVQDPFVQTSLINMYSSCGDLIVSRQAFDEITQPDVASWNSIIHAYVKVGLIDLARGLFDKMPERNVRSWSSLINGFVRCGKYKEALALFREMQMLAVNDVRPNEFTMSAVLSACGRLGALEHGKWAHAYIDKCGIKIDVVLGTSLIDMYGKCGSIEKARDVFSNLGPDKDVLAWSAMISGLAMHGHGDECLKLFSEMIKRQVRPNAVTFLGVLCACVHGGLVNDGKEYFRRMSKEFGIIPSIQHFGAMVDLYGRAGLIDEAWNVVKSMPMEPDVLVWGSLLSGSRMCGNIETCEVALRKLIELDPTNSGAYVLLSNVYAKTGRWTEVRRVRDVMEGRGIKKVPGCSLVEMGGVLHEFFVGDDSLPESREIYMMLDEIMKRLKLEGYVGNTKEVLLDLDEEGKELALSLHSEKIAIAFCFLKTSPGIPIRIIKNLRICLDCHVAIKMISKVFAREIVIRDCNRFHHFREGACSCEDYCWKHHQPSPGLRLCRWRARAILRVFTGCLNSEGRLIKMVLVRKQAADDSMLGISSILEDEKCESFLLFYGETLLGNGLRAFLYFLGLAYCFIGLSAITARFFRSMENVVKHTRTVVEIDPVTNTEIYRQEKVWNYTIADITLLAFGTSFPQISLATIDAIRNIGDLYAGGLGPGTLVGSAAFDLFPIHAVCVVVPKAGELKKISDIGVWLVELFWSFWAYAWLYIILEVWTPKVVTLWEALLTVLMYGLLLTHAYAQDKRWPYISLPIERTERPEDWVPAEVASVKHEGDACDGYSEILPVEENEGKDTVDIFSFHSEIGAGSSYQKVSTIEDLVETSTRPFQKEIDLEDPHVLELWKTQFLDALTLESPESRKLNNIHLRLASIVWQSLLAPWRVLFAIVPPYQIAHGWIAFICSLLFISGIAYIVTELTDLISCVTGINAYVIAFTALAAGTSWPDLVASKIAAERQITADSAIANITCSNSVNIYVGIGIPWLIDTAYNFIAYREPLRIQNAGGLSFSLLVFFSTSVGCILVLVIRRLTLGAELGGPRIWAWVTSVFFMLLWIIFVVLSSLRVSGII |